Answer Retrieval From Extracted Tables
نویسندگان
چکیده
Question answering (QA) on table data, which contains densely packed information in two-dimensional form, is a challenging information retrieval task. Data can be placed at a distance from the metadata describing it. The metadata itself can be difficult to identify given the layout of a particular table. This paper describes a QA system for tables created with both machine learning and heuristic table extraction methods. Our approach creates a cell document for each table cell. A probabilistic language model selects the most likely cell documents for the information need. The performance of the system is tested with government statistical data, and errors are analyzed in order to improve the system. We also apply these improvements on another type of table data set and show the experimental results.
منابع مشابه
Statistical Machine Translation for Query Expansion in Answer Retrieval
We present an approach to query expansion in answer retrieval that uses Statistical Machine Translation (SMT) techniques to bridge the lexical gap between questions and answers. SMT-based query expansion is done by i) using a full-sentence paraphraser to introduce synonyms in context of the entire query, and ii) by translating query terms into answer terms using a full-sentence SMT model traine...
متن کاملImproving Text Retrieval Precision and Answer Accuracy in Question Answering Systems
Question Answering (QA) systems are often built modularly, with a text retrieval component feeding forward into an answer extraction component. Conventional wisdom suggests that, the higher the quality of the retrieval results used as input to the answer extraction module, the better the extracted answers, and hence system accuracy, will be. This turns out to be a poor assumption, because text ...
متن کاملColing 2008 22 nd International Conference on Computational Linguistics
Question Answering (QA) systems are often built modularly, with a text retrieval component feeding forward into an answer extraction component. Conventional wisdom suggests that, the higher the quality of the retrieval results used as input to the answer extraction module, the better the extracted answers, and hence system accuracy, will be. This turns out to be a poor assumption, because text ...
متن کاملAnswer Passage Retrieval for Question Answering
Document or passage retrieval is typically used as the first step in current question answering systems. The accuracy of the answer that is extracted from the passages and the efficiency of the question answering process will depend to some extent on the quality of this initial ranking. We show how language model approaches can be used to improve answer passage ranking. In particular, we show h...
متن کاملAttribute Retrieval from Relational Web Tables
In this paper, we propose an attribute retrieval approach which extracts and ranks attributes from HTML tables. Given an instance (e.g. Tower of Pisa), we want to retrieve from the Web its attributes (e.g. height, architect). Our approach uses HTML tables which are probably the largest source for attribute retrieval. Three recall oriented filters are applied over tables to check the following t...
متن کامل